Context-aware Language Modeling for Conversational Speech Translation
نویسندگان
چکیده
Context plays a critical role in the understanding of language, especially conversational speech. However, few approaches exist to utilize the external contextual knowledge which is readily available to practical speech translation systems deployed in the field. In this work, we propose a novel framework to integrate context in the language models used for conversational speech translation. The proposed approach takes into account the contextual distance between a test utterance and the training corpus, a measure obtained from the external context in which the utterances were spoken. Language model probabilities are adjusted through a sentence-level weighting scheme based on this context-distance measure. When incorporated into our English-Iraqi Arabic speech-to-speech translation system, the proposed approach obtains improvements in both speech recognition accuracy and translation quality compared to the baseline system.
منابع مشابه
Source-Error Aware Phrase-Based Decoding for Robust Conversational Spoken Language Translation
Spoken language translation (SLT) systems typically follow a pipeline architecture, in which the best automatic speech recognition (ASR) hypothesis of an input utterance is fed into a statistical machine translation (SMT) system. Conversational speech often generates unrecoverable ASR errors owing to its rich vocabulary (e.g. out-of-vocabulary (OOV) named entities). In this paper, we study the ...
متن کاملComedy, Context and Unsaid Meaning: A Case Study in Conversational Implicature
Pragmatics moves away from the word level and sentence level study of language towards the study of language in real-world context and at discourse level whereby two or more participants take part in conversation. There are moments when the speaker explicitly says something but the listener may have other interpretations and inferences from their statements. The aim of this study was to demonst...
متن کاملA language model for conversational speech recognition using information designed for speech translation
In this paper, a new language model is proposed for speech recognition in conversational speech translation. In conversation, speech strongly depends on the previous utterance of the other participant. Applying this dependency in language modeling, we can reduce the speech recognition error rate. To this end, we propose the following new language model where the content of the previous utteranc...
متن کاملA Dual Encoder Sequence to Sequence Model for Open-Domain Dialogue Modeling
Ever since the successful application of sequence to sequence learning for neural machine translation systems (Sutskever et al., 2014), interest has surged in its applicability towards language generation in other problem domains. Recent work has investigated the use of these neural architectures towards modeling open-domain conversational dialogue, where it has been found that although these m...
متن کاملInteractive Translation of Conversational Speech
We present JANUS-II, a large scale system effort aimed at interactive spoken language translation. JANUS-II now accepts spontaneous conversational speech in a limited domain in English, German or Spanish and produces output in German, English, Spanish, Japanese and Korean. The challenges of coarticulated, disfluent, ill-formed speech are manifold, and have required advances in acoustic modeling...
متن کامل